Matrix Graph Grammars as a Model of Computation 



Pedro Pablo Perez Velasco 

Escuela Politecnica Superior 
Universidad Autonoma de Madrid 

pedro.perez@uam.es 



Abstract. Matrix Graph Grammars (MGG) is a novel approach to the study of 
graph dynamics (| 16 1). In the present contribution we look at MGG as a formal 
grammar and as a model of computation, which is a necessary step in the more 
ambitious program of tackling complexity theory through MGG. We also study 
its relation with other well-known models such as Turing machines (TM) and 
Boolean circuits (BC) as well as non-determinism. As a side effect, all techniques 
available for MGG can be applied to TMs and BCs. 
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1 Introduction 

Graph transformation M4I20II is becoming increasingly popular in order to describe sys- 
tem behavior due to its graphical, declarative and formal nature. It is central to many 
application areas, such as visual languages, visual simulation, picture processing and 
model transformation (see H and ll20l Vol.2 for some applications). Also, graph rewrit- 
ing techniques have proved useful in describing Domain Specific Language^ and in 
language-oriented programming (see |8|). 

Matrix Graph Grammars (MGG) is a purely algebraic approach to graph rewriting 
(graph dynamics) that has successfully solved or extended problems and results such as 
sequentialization, explicit parallelism, applicability, congruence characterization, initial 
state calculation (initial digraphs), constraints (application conditions) and reachability. 
See for example f 1 2I13I14I15I18I191 or the more comprehensive introduction fTSJ. 

It seems natural to study MGG as a model of computation that would describe the 
state of some system by means of graphs. To the very best of the author's knowledge, 
there has been no attempt in this direction until now, despite the agreed interest of the 
topic (see e.g. [9 10|). Even more so, there seems to be no graph rewriting literature on 
non-determinism or, more generally, on complexity. 

There are two main handicaps that may partially explain the lack of theoretical 
results. First, graph rewriting systems (MGG in particular) would make use of the func- 
tional problem known as subgraph matching. Its associated decision problem {subgraph 



' Following (3), a domain-specific language (DSL) is a programming language or executable 
specification language that offers, through appropriate notations and abstractions, expressive 
power focused on, and usually restricted to, a particular problem domain. 



isomorphism) has been proved to be NP-complete (||6|). Even worse, subgraph match- 
ing would be used in every single step of the computation. Second, there are two ap- 
parently hard-to-avoid sources of non-determinism: what production rule to apply and 
where to apply it. 

The author's main motivation for the development of MGG is the study of complex- 
ity theory - complexity classes P and NP in particular - which has led the research in 
MGG up to nowo The present contribution is a necessary step in this program. 

Due to their current theoretical and practical relevance, we have decided to study 
Turing machines (TM) and Boolean Circuits (BC) and compare them to MGG. As a 
side effect, all techniques developed for MGG become available to study TMs and BCs. 
Currently we are capable to (partially) answer the following questions within MGG: 

1. Coherence: are the actions specified by the productions (inside a sequence) com- 
patible with each other? 

2. Initial state: calculate an initial configuration (initial grammar state) such that some 
given sequence can be applied. 

3. Congruence: do a sequence and a permutation of it have a common initial state? 

4. Sequential independence: is it possible to advance/delay some production a finite 
number of positions inside a given sequence? 

5. Reachability: for a given grammar and initial and final states, find a sequenc^that 
transforms the initial state into the final state. 

Paper Organization. Section |2] is an overview of the very basics of MGG and briefly 
explains some of the analysis techniques developed so far. Section |3] characterizes re- 
labeling in MGG. Section |4] and |5] study MGG as a formal grammar and as a model of 
computation, respectively, touching on some possible submodels and supermodels of 
computation that can be defined from MGG. Determinism is addressed in Secs.|6]and|7] 
in which we move from Boolean algebra to an algebra of matrices. In Sees. |8]and|9] 
MGG models Turing machines and Boolean Circuits, respectively. Section [TO] closes 
this paper with a short summary and some proposals for future research. 
Notation. The matrix whose entries are all zero will be represented with a bolded zero, 
0. Similarly, the matrix in which every single element is a one will be represented with 
a bolded one, 1. 

2 Matrix Graph Grammars: Basics 

In this section we give a brief overview of some of the basics of Matrix Graph Gram- 
mars (MGG) with examples as intuitive as possible. For a detailed account and acces- 
sible presentation the reader is referred to L16J . 

Simple Digraphs. We work with simple digraphs, which we represent as (M, V) where 

^ Almost all concepts and problems addressed in |16 | directly study sequentialization. Recall 
that P, NP and in general the classes in VTl (the polynomial hierarchy) encode sequentializa- 
tion. 

^ ... or provide information about it, e.g. what production rules have to be applied and how 
many times each production should be applied. 



M is a Boolean matrix for edges (the graph adjacency matrix) and V a Boolean vector 
for vertices or nodes. Note that we explicitly represent the nodes of the graph with a 
vector This is necessary because it is possible within MGG to add and delete nodes. 

The existing nodes are marked with a 1 in the corresponding position of the vec- 
tor. Figure [TJa) shows a graph representing a production system made of a machine 
(controlled by an operator) which consumes and produces pieces through conveyors. 
Generators create pieces in conveyors. Self loops in operators and machines indicate 
that they are busy. 
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Fig. 1. Simple Digraph Example (left). Matrix Representation with Labels (right) 



Note that the matrix and the vector in the figure are the smallest ones able to repre- 
sent the graph. Adding 0-elements to the vector (and accordingly 0-rows and columns 
to the matrix) would result in equivalent graphs. Next definition formulates the repre- 
sentation of simple digraphs. 

Definition 1 (Simple Digrapli Representation). A simple digraph G is represented by 
G = {M, V) where M is the graph adjacency matrix and V is the Boolean vector of 
its nodes. 

Compatibility. Well-formedness of graphs (i.e. absence of dangling edges) can be 
checked by verifying the identity ||(M V M*) — 0, where is the Boolean 
matrix product (like the regular matrix product, but with and and or instead of multi- 
plication and addition), Af * is the transpose of the matrix M, V is the negation of the 
nodes vector V, and || • ||i is an operation (a norm, actually) that results in the or of 
all the components of the vector. This property is called compatibility in 1 12|. Note that 
M QV results in a vector that contains a 1 in position i when there is an outgoing edge 
from node i to a non-existing node. A similar expression with the transpose of M is 
used to check for incoming edges. A simple digraph G = (M, V) is compatible if and 
only if 

||(MVM*) 0F||^ =0. (1) 

Compatibility of productions guarantees that the image of a simple digraph is a sim- 
ple digraph again. It is useful to check closedness of the space (simple digraphs) under 
the specified operations (grammar rules). 

Labeling. A label (also known as a type) is assigned to each node in G = (A/, V) 
by a function from the set of nodes F to a set of labels T, \ : V ^ T. In Fig. [T] 



labels are represented as an extra column in the matrices, the numbers before the colon 
distinguish elements of the same type. 

There are several equivalent possibilities to label edges. We may use the types of 
their source and target nodes. Another possibility is to choose the set of edges E{G) as 
domain for A (see Def.|2 1 instead of V. Notice that this would define A for every element 
in the adjacency matrixo Yet another possibility is to define labels just on nodes and 
split one edge N1N2 (the one joining node iVi to node A'2) into two edges, the first 
starting in node A^i and ending in a "labeling node" and the second starting in this same 
"labeling node" and ending in A^2- 

Definition 2 (Labeled Simple Digraph). A labeled simple digraph G\ = (G, A) = 
((M, V), A) over a set of labels T is made of a simple digraph G plus a function 
A : y — > T whose domain is the set of nodes of G. 

By abuse of notation, the subscript A will normally be omitted. Next we define the 
notion of partial morphism between labeled simple digraphs. 



Definition 3 (Labeled Simple Digraph Morphism). Given two simple digraphs Gi = 
((Mi, Vi), \i :Vi ^ T) fori = {1, 2}, a morphism f = (/^, /^) : Gi ^ G2 is made 
of two partial injective functions : Vi V2, '■ Mi — > M2 between the set of 
nodes and edges, such that: 

1. Vv e Domif''), Ai(«) = A2 (/^(«)) 

2. V(n,m) e Domif'^), f'^{n,m) = {f^{n)J^{m)), 

where Dom(f) is the domain of the partial function f, E stands for edges and V for 
vertices. 



Productions. A production, p : L ^ i? is a morphism of labeled simple digraphs. 
Using a static formulation, a rule is represented by two labeled simple digraphs that 
encode the left and right hand sides (LHS and RHS, respectively). The matrices and 
vectors of these graphs are arranged so that the elements identified by morphism p 
match (this is called completion, see below). 

Definition 4 (Static Formulation of Productions). A production p : L ^ Ris stati- 
cally represented as 

p = (L, R) = {{L^, L^, Xl), (i?^, i?^, Xr)) . (2) 

A production adds and deletes nodes and edges, therefore using a dynamic formu- 
lation, we can encode the rule's pre-condition (its LHS) together with matrices and 
vectors representing the addition and deletion of edges and nodes. We call such matri- 
ces and vectors e for "erase" and r for "restock". 



Similarly, labels on the nodes would be equivalent to defining A for the elements in the main 
diagonal of the adjacency matrix. 



Definition 5 (Dynamic Formulation of Productions). A production p : L ^ R is 
dynamically represented as 

p = {L, e, r) = {{L^, L^, Al) , (e^, e^, A,) , (r^, r^, A.)) , (3) 

where and are the deletion Boolean matrix and vector ( respectively), and 
are the addition Boolean matrix and vector ( respectively). They have a 1 in the position 
where the element is to be deleted (for e) or added (for r). 

The output of rule p - where the and symbol A is omitted to simplify formula^ - 
is calculated by the Boolean formula 

R = p{L) = rVeL, (4) 

which applies to both, edges and nodes. 

Example. DFigurelDshows a rule and its associated matrices. The rule models the con- 
sumption of a piece by a machine. Compatibility of the resulting graph must be ensured, 
thus the rule cannot be applied if the machine is already busy, as it would end up with 
two self loops, which is not allowed in a simple digraph. This restriction of simple 
digraphs can be useful in this kind of situations, and acts like a built-in negative appli- 
cation condition (refer to [161, Chaps. 8 and 9). Later we will see that the nihilation 
matrix takes care of this restriction. 
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Fig. 2. (a) Rule Example, (b) Static Formulation, (c) Dynamic Formulation. 



Completion. In order to operate graphs of different sizes, an operation called comple- 
tion adds extra rows and columns with zeros (to matrices and vectors) and rearranges 
rows and columns so that the identified edges and nodes of the two graphs match. For 
example, in Fig.|2l if and R^ need to be operated, completion adds a fourth 0-row 
and fourth 0-column to R^. 

Otherwise stated, whenever we have to operate graphs Gi and G2, an implicit mor- 
phism f : Gi ^ G2 has to be established. This morphism is completion, which rear- 
ranges the matrices and the vectors of both graphs so that the elements match. In the 



^ In order to avoid ambiguity we shall state that A has precedence over V. 



examples, we omit such operation, and assume that matrices are completed when nec- 
essary. 



Nihilation Matrix. In order to consider the elements in the host graph that disable a 
rule application, we extend the notation for rules with a new graph K. Its associated 
matrix specifies the two kinds of forbidden edges: those incident to nodes which are 
going to be erased and any edge added by the rule (which cannot be added twice, since 
we are deahng with simple digraphs). Notice however that K considers only potential 
danghng edges with source and target in the nodes belonging to . 

Definition 6 (Niliilation Matrix). Given the production p = (L, e, r), its nihilation 
matrix contains non-zero elements in positions corresponding to newly added 
edges, and to non-deleted edges adjacent to deleted nodes. 

We extend the rule formulation with this nihilation matrix. The concept of produc- 
tion remains unaltered because we are just making explicit some implicit information. 
Matrices are derived in the following order: {L,R) i— > (e,r) i-^ K. Thus, a rule is 
statically determined by its LHS and RHS p = {L, R), from which it is possible to give 
a dynamic definition p ~ {L,e,r), with e = LR and r = RL, to end up with a full 
specification including its environmental behavior p = {L, K, e,r). No extra effort is 
needed from the grammar designer because K can be automatically calculated as the 
image by the rule p of a certain matrix (see below). 

Notice that the nihilation matrix K for a production p has an associated simple 
digraph whose nodes coincide with those of L as well as its labels. 

Definition 7 (Dynamic Reformulation of Production). A production p : L ^ R is 
dynamically represented as 

p = {L,K,e,r,X) = 
= {{L^, L^, X,) , {K^, K^, Xk) , (e^, e^, A^) , (r^, r^, A.)) , (5) 

where L is the LHS, K is the labeled simple digraph associated to the nihilation matrix, 
e is the deletion matrix and r is the addition matrix. 

The nihilation matrix of a given production p is given by 

K^=p(jy) with D = e^(E)e^\ (6) 

where (g) denotes the tensor (or Kronecker) product, which sums up the covariant and 
contravariant parts and multiplies every element of the first vector by the whole second 
vector. See |fT9l for a proof. Notice that matrix D specifies potential dangling edges 
incident to nodes in p's LHS: 

D = ce = l^ (e^' = 1 or (e^)^ = 1. 

^ 1 otherwise. 

Example. DThe nihilation matrix for the example rule of Fig. |2]is calculated as 
follows: 
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The nihilation matrix is then given by : 
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K = r\JeD^ 
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The matrix indicates any danghng edge from the deleted piece (the edge to the 
conveyor is not signaled as it is explicitly deleted) as well as self-loops in the machine 
and in the operator. ■ 
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Fig. 3. K Graph for startProcess 



The evolution of the rule LHS (i.e. how it is transformed into the RHS) is given 
by the production itself: R = p{L) = r V eL. See equation dUi. It is interesting to 
analyze the behavior of the nihilation matrix, which is given in the next proposition. 
Let p : L ^ Rhe a compatible production with nihilation graph K. Then, the elements 
that must not appear (Q) once the production is applied (see [19] for a proof) are given 
by 

Q = p-'^{K) = e\JrK^. (7) 
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Fig. 4. Evolution of the Nihilation Matrix 



Example.DFigurell] shows the calculation of startProcess ^{K^) using the graph 
representation of the matrices in eq. (|7]i- B 



The dual concept T specifies the newly available edges after the application of a 
production due to the addition of nodes: 

T = {r ® r*^ A (e (g) e*) . (8) 

Refer to [TSl] where T was introduced and studied. 

Next definition introduces a functional notation for rules (already used in ifTSll ') in- 
spired by the Dirac or bra-ket notation |[T] . 

Definition 8 (Functional Formulation of Production). A production p : L ^ R can 
be depicted as R — p{L) — {L,p), splitting the static part (initial state, L) from the 
dynamics (element addition and deletion, p). 

Using such formulation, the ket operators (i.e. those to the right of the bra-ket) can 
be moved to the bra (i.e. left hand side) by using their adjoints (which are usually dec- 
orated with an asterisk). 

Match and Derivations. Matching is the operation of identifying the LHS of a rule 
inside a host graph (we consider only injective matches). Given the rule p : L ^ R and 
a simple digraph G, any m : L ^ G total injective morphism is a match for p in G. 
The following definition considers not only the elements that should be present in the 
host graph G (those in L) but also those that should not be (those in K). 

Definition 9 (Direct Derivation). Given the rule p : L R and the graph G = 
{G^,G^) as in Fig. ^a), d — {p,m) - with m — (rnL,m^^ - is called a direct 
derivation with result H — p* (G) if the following conditions are fulfilled: 

1. There exist m.^ : L G and rn|| ; — )■ G^ total injective morphisms. 

2. rriLin) — mf^{n), Vri G . 

3. The match ttil induces a completion of L in G. Matrices e and r are then completed 
in the same way to yield e* and r*. The output is given by H = p* (G) = r* V e*G. 
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Fig. 5. (a) Direct Derivation, (b) Example 



Remark.DThe square in Fig.|5](a) is a categorical pushout. Item 2 is needed to ensure 
that L and are matched to the same nodes in G. ■ 
Example. DFigurelSjb) shows the appHcation of rule startProcess to graph G. We have 
also depicted the inclusion of in G^ (bidirectional arrows have been used for sim- 
plification). G^ is the complement (negation) of matrix G^ . ■ 

It is useful to consider the structure defined by the negation of the host graph, G = 
{G^,G^). It is made up of the graph G^ and the vector of nodes G^ . Note that the 
negation of a graph is not a graph because in general compatibility fails, that is why the 
term "structure" is used. 

The complement of a graph coincides with the negation of the adjacency matrix, 
but while negation is just the logical operation, taking the complement means that a 
completion has been performed in advance. That is, the complement of graph G with 
respect to graph A, through a morphism / : A ^ G is a two-step operation: (i) complete 
G and A according to /, yielding G' and A'; (ii) negate G'. As long as no confusion 
arises negation and complements will not be distinguished syntactically. 
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Fig. 6. Finding Complement and Negation of a Graph. 



Examples. DSuppose we have two graphs A and G as those depicted in Fig.|6]and that 
we want to check that A is not in G. Note that A is not contained in G (an operator 
node does not even appear) but it does appear in the negation of the completion of G 
with respect to A (graph Ga in the same figure). 

In the context of Fig.|3b) we see that there is an inclusion startProcess~^ (K^) 
H (i.e. the forbidden elements after applying production startProcess are not in H). 
This is so because we complete H with an additional piece (which was deleted from 
G). ■ 

Analysis Techniques. In II12I13I14I15I161 we developed some analysis techniques, 
mainly to study sequences in MGG. We end this section with a short summary of these 
techniques with the exception of application conditions, graph constraints and explicit 
parallelism. 

One of the goals of our previous work was to analyze rule sequences independently 
of a host graph. We represent a rule sequence as s„ = p„; where application is 

from right to left (i.e. pi is applied first). For its analysis, we complete the sequence 
by identifying the nodes across rules which are assumed to be mapped to the same 
node in the host graph. Mind the non-commutativity of this operation and its potential 
non-determinism. 

Once the sequence is completed, our notion of sequence coherence |12 | allows to 
know if, for the given identification, the sequence is potentially applicable (i.e. if no rule 



disturbs the application of those following it). The formula for coherence results in a 
matrix and a vector (which can be interpreted as a graph) with the problematic elements. 
If the sequence is coherent, both should be zero, if not, they contain the problematic 
elements. We shall elaborate on this in Secs.|6]and|71 

A coherent sequence is compatible if its application produces a simple digraph. 
That is, no dangling edges are produced in intermediate steps. This extends the notions 
of compatible graph and compatible production. 

Given a completed sequence, the minimal initial digraph (MID) is the smallest graph 
that allows applying such sequence. Conversely, the negative initial digraph (NID) con- 
tains all elements that should not be present in the host graph for the sequence to be 
appUcable. In this way, the NID is a graph that should be found in G for the sequence 
to be applicable (i.e. none of its edges can be found in G). We shall elaborate on this in 
Sec.|7l 

We shall not touch on other concepts such as G-congruence and reachability. No- 
tice that reachability can be thought of as an initial attempt to measure the number of 
elements in a sequence, a tool that might be useful to provide lower bounds. Similarly 
to what is done in the present contribution for Turing Machines and Boolean Circuits, 
Petri nets can be modeled with Matrix Graph Grammars. See ifTSl and lfT6l . 

3 Relabeling 

Completion as introduced in Sec. |2] performs two tasks: enlarges matrices by adding 
rows and columns of the appropriate type and rearranges them (to get a coincidence 
according to the identifications across productions). The first task - as we shall see 
in Sec. |4]- has to do with the dimension of the underlying algebraic structure. The 
second is directly related to non-determinism, for which we need to give an operational 
definition. The section is dedicated to this topic. 

Relabeling is just a permutation on nodes. A simple observation is that to any per- 
mutation (T e Sn, a permutation Al matrix a can be associated with cr* = 1 cr{i) = 
j. Notice that a Boolean matrix is a permutation matrix with respect to the action de- 
fined in eq. (|9|l if and only if it has a single 1 per row and column. Its action p is defined 
by 

p^{L) ^ (t{L) = a. L = (T Q L Q a\ (9) 

where is the matrix product but with or operations instead of sums and with and 
instead of multiplication, and L is the adjacency matrix of some simple digraph. For a 
quick introduction on permutation matrices, please refer to [24J . We shall also use • in 
place of 0. 

As a matter of fact, eq. ® defines a production as it transforms one simple digraph 
into another simple digraph|j As such, it can be expressed in terms of some appropriate 
e and r matrices: 

e = L(^) =L(^cr-L-cr*) , r^L{a.L)^L{a-L-a*). (10) 

* By abuse of notation we shall represent the permutation, its associated matrix and the relabel- 
ing production that it defines with the same letter a. 



Recall that an elementwise and is assumed when the operation is omitted. If A and B 
are n x n Boolean matrices then AB = 
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Fig. 7. Example of Permutation (Left) and its Associated Production (Right) 



Example. DSuppose we are given the permutation ct = (1 2 3) G S'3 to be applied to 
the graph L, depicted to the right of Fig. |7] The associated permutation matrix is 

1 

1 

1 

Its action is equivalent to a node relabeling where node 1 plays the role of node 2, 
node 2 that of node 3 and node 3 becomes node 1. If we wanted to put cr as an (e, r) 
production, its erasing and addition matrices would be: 
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which have been calculated using eq. ( flOl l. 

Say we need to relabel node 3 in Fig. |2]with a new type, for example 4. It would 
be possible to proceed in three stages: first add node 4, then permute nodes 3 and 4 
applying the 2-cycle 0-34 = (3 4)- its graph will have nodes 1 and 2 with self edges - 
and then delete node 3. ■ 

Despite the possibility of reducing to some (e, r) productions, we shall introduce 
this new operation, extending the notion of production to that of affine production. The 
main reason is that, according to eq. dTol l. the relabeling expressed in terms of (e, r) 
(as a production) would have different matrices depending on the graph it is applied to. 
The action of relabeling should have associated a single element, independently of the 
simple digraph it acts on. 

Through concatenation we obtain the two possible combinations of operations p; a 
and (t; p, which are respectively: 



P;(t{L) = r\/e{a ■ L ■ a*^) 
cj;p{L) = a ■ (rWeL) ■ a* 

We shall be more interested in the combination of both a;p; a 
apply production and permute the result. It is readily seen that 



(11) 
(12) 

permute the graph, 



p^iL) = cr;p; a{L) = a ■ [r \J e {a ■ L ■ ct*)] • ct* = [cr • (r V e) • cr*] L, (13) 



which defines the action of eq. ^ for productions. Equation ( fT3l l can be rewritten 

p-{L) = {L,a.p). 



Definition 10 (Afflne Production). A production p : L ^ R as defined by eq. ( UH ) is 

dynamically represented by 

p={L,K,e,r,a,\) (14) 

where L is the LHS, K is the labeled simple digraph associated to the nihilation matrix, 
e is the deletion matrix, r is the addition matrix, a is a relabeling and A is the node 
labeling mapping. 

Clearly, making a the identity returns an (e, r) production (which will be known 
as the traslational part of the affine production) and setting e = r = transforms the 
production in a pure relabeling action (which will be known as the rotational part of 
the affine production). 

Proposition 1. The action on productions Pa{p) = p'^ is an homomorphism when the 
concatenation operation is considered. 

Proof 

□We have to check the following three properties: 

1. PaW = 1. 

2. Pa{p2\Pi) = ct;p2;Pi;o- = a;p2\<7;a\pi\<7 = pa{p2); Pa{Pl)- 

'i- Pa {p'^) = <j;p~^;cr = {a-^;p;a-^) ^ {a;p; = [p^ (p)]^'^ . ■ 

4 Formal Grammar 

In this section we give a definition of MGG as a formal grammar and postpone its study 
as a model of computation to the next section. 

In this author's opinion, the main drawback to approach the PvsNP problem (com- 
plexity theory in general) is the lack of branches of mathematics available|Z| The mo- 
tivation to introduce MGG as a formal grammar is to pose an algebraic approach to 
complexity theory. This is done prior to its definition as a model of computation to 
make the link more evident. 

The exposition moves from the abstract to the concrete. Recall that a formal gram- 
mar is the quad-tuple {Af, S^V , Sq), being M a finite set of non-terminal symbols, S 
a finite set of terminal symbols [M fl i7 = 0), 7-" a finite set of production rules and 
sq E J\f the start symbol. Production rules have the general form 

{SUAf)*Af{UUj^)* ^ {SUj^y , (15) 

where * is the Kleene star operator]! See for example I?]. Equation ( fTSl l demands at 
least one non-terminal symbol on the LHS. The operations on chains of symbols are 

^ An interesting exception being that of GCT by Mulmuley and Sohoni ( II II ). 
* If V is a set then V is the smallest superset that contains the empty string and is closed under 
the string concatenation operation. 



concatenation and non-terminal symbol replacement, the latter defined through produc- 
tion rules. We shall dedicate the rest of the section to these two operations. 

We shall start with the set of rt x n matrices over the field Z2[i], Af„ (Z2[i]). An 
element z — x + iy E l,2[i] is such that x,y £ {0, 1}. We shall consider the following 
operations over M„ (Z2 [i]): 

1. Multiplication by scalars. Represented by •. As we are in Z2, this operation either 
lets the element unaltered or transforms it into the zero matrix 0. 

2. Matrix addition. Represented by + and defined in the usual way. 

3. Matrix multiplication. Represented by or by • if no confusion with the multi- 
plication by scalars may rise. Defined as usual, with addition and multipUcatior0 
carried out over Z2 . 

4. Pointwise multiplication. Represented by A (omitted by default). Let be given C,TZ £ 
M„ (Z2 with C = L + iK andn = R + iQ, then 

Chn = Cn={LR + KQ) + i{LQ + KR) (16) 

where XY is the elementwise multiplication, XY = {x^mVrnjj m£{i n}' 

5. Scalar product. Let Cj — Lj + iKj E Mn (Z2[i]), then the scalar product is 
represented by {Ci,£2) and defined if 

(Li+i^i)(l + L2 + i^2) = 0. (17) 

This condition guarantees that £1, £2 G (Z2[i]) =^ (£i,£2) GAf„(Z2[i]). 
See the proof of Prop. |2]on p fT4l 

As we shall be almost exclusively interested in those elements that have disjoint real 
and imaginary parts, let's introduce 

M" = {£ = L + iK e Mn (Z2W) I LA' = 0} . (18) 

Elements of M" will be known as Boolean complex matrices. Let T be some finite 
set and for g E M" consider the mapping \g : g T defined only for elements in the 
diagonal of L and K and such that \{L) = X{K). The meaning of A will be clarified in 
Sec.|5] Its subscript will be usually omitted. 

Except for some restrictions on the operations - their concrete definition taking A 
into account are given in eqs. ( |22] |. (l23T l and (l24l i below - the (vector) space on which 
we shall focus our attention is (M", +, • , 0), where 

M" = = (5, A) .g e M" and A as above | . (19) 

For MGG, the quad- tuple — {Af, S,V ^sq) that defines the formal grammar has 
the following elements: 



' In MGG, the matrix multiplication uses and and or. The and operation coincides with the 
multiplication over Z2. It is not difficult to see that the or operation can be replaced by the xor 
as long as one of the matrices involved is a permutation matrix. The equality p\/ q = p+pq + q 
in propositional logics can be used together with the fact that all elements but one in a row 
(column) are zero. 



- J\f — {£i I 3pi : Ci ^ TZi, Pi & V}, the finite set of elements in M" that appears 
in the LHS of some production rule. 

- r = {£ e M" I £ ^ A/"}, the finite set of elements in M" not belonging to TV. 
-V ^ {p : C^H\L&N, TleSyj TV}, the finite set of production rules as de- 
fined in eq. ( |20] l below. 

- S'o € M", the start symbol. 

MGG as a formal grammar (and as a model of computation) will be limited to 
"affine" mappings, which correspond to non-terminal symbol replacement (concatena- 
tion is addressed by the end of the section). A grammar rule p has the general form 

Si = p(5o) = (5*0, (j.uj) ^ {Sq,(7 ■ uj ■ o-*) , (20) 

where 5o,5i e M", cr is a permutation matrix (a single 1 per row and column; see 
Sec. [3]l and u e M" is such that there exists a e M" with zero imaginary part that 
fulfills 

oj = a + i{a + l), (21) 

being 1 e M" the nxn matrix filled up with ones (by eq. ( fTSl l the imaginary part must 
be zero). These elements have been introduced in ifTTl . so-called swaps. 

The way operations in eqs. (|20| ) and ( 1211 1 are carried out need to be clarified regard- 
ing the A mappings, which are part of the elements of M" according to ( fT9l ): 

1 . Multiplication by scalars is defined by 

aC= {aL^iaK,\{C)) , a = 0,1. (22) 

2. The addition Ci + £2 should respect the mapping A so it is allowed just in case 
A£,(/:i) = A£,(/:2): 

£i+£2 = {{Li + L2) + i{Ki + K2),\{Ci)) = (23) 
= {{Li + L2) + i{Ki + K2),X{C2))- 

3. Recall from Sec. [3] that permutation matrices define a permutation on the indices 
of the elements of M", cr(l 2 . . . n) = (a(l) ct(2) . . . a{n)). For £ e M" the 
matrix multiplication is given by 

a. C = a ■ C ■ (J^ = a ■ [L + iK,\) ■ (T^ = 

^{<j-L-a'+ta-K-a\X (£.o>o))) . (24) 

The following proposition now follows easily: 

Proposition 2. The formal grammar (5 is closed under the operations given by the 
production rules as defined in eq. (120b . 

Proof 

□The proof can almost be derived from the fact that M" is closed under the operations 
•, +, defined in eqs. ^ and (O: 



1. a = 0, 1, £ e M" =^ a- CeWr holds because e M". 

2. C E M" , a permutation a ■ C ■ a* E M" because it is just a relabeling. 

3. £i, £2 e M" =^ Li + L2 e M" because addition respects labeling. 

A little of extra work is needed for the scalar product: 

{Ci,C2) = {Li+iKi,L2 + iK2) = {Li + iKi) {1 + K2 + i{l + L2)) = 
= [Li + L1K2 + K + K1L2) +i[Ki+ K1K2 +Li+ L1L2) = 
— a + i(3. 

so (£1, £2) G M" if a/3 = 0. After some manipulations, we obtain that 

a/3 = (Li+i^i)(l + L2+i^2). (25) 

To see that eq. (1211 1 guarantees a(3 = simply substitute it into eq. (IZST l. ■ 

Concatenation as an operation is simpler than non-terminal symbol replacement. In 
essence it consists in passing from M™ to M" with m < n. Assuming that T is not 
enlarged|3 all we have to do is to add n — m rows and columns to the matrices that 
define the production rules (filling them with zeros) and consider these new rows and 
columns in the permutation matrices. 

The production rules in 6 have the general form M N, which is a particular 

case of eq. ( fTSI ). Taking concatenation into account, the general form that appears in eq. 
(fTSl) is recovered. 

5 Model of Computation 

In this section we shall start by informally providing some semantics to the elements and 
operations of the formal grammar 6 of Sec.|4l elements of M" (permutation matrices 
in particular), addition, matrix multiplication, the set T, the mapping A and the element 
u) as introduced in eq. (l2Tl l. After this, the nodeless MGG model of computation is 
defined. The section is closed with a short summary on some MGG submodels and 
supermodels of computation. 

Let C ^ L + iK E M". The Boolean matrix L can be seen as the adjacency matrix 
of some simple digraph g. This matrix is completed with K, which we will interpret as 
those edges that can not belong to g0 

A meaning for T could be that of (node) labels. Closely related is the mapping A 
that assigns a label to each node. This is why it is just defined for the elements in the 
main diagonal. It is natural to impose A(i) — ^{K). The operation defined by eq. (|24] | 
can be understood as a node relabeling. Refer to Sec. [3] 

The definition of uj given in eq. (I2TI) simply states that we have to do inverse op- 
erations on L and K (refer to Sec. |2]i. The permutation matrix cr allows uj to act on 
rearrangements of elements equally labeled. 

If T is modified then we would be redefining the grammar rather than concatenating symbols. 
' ' The importance of K stems from the fact that we will be studying graph dynamics. We need 
to specify that one edge is not present in a given graph for example because it is going to be 
added by some production. 



Notice that no restrictions can be set on the applicabihty of production rules ac- 
cording to eq. ( |20] l. i.e. swaps can always be applied. We shall introduce a new means 
to define productions (as introduced in MGG; see Sec.|2]l using the Boolean operations 
and (pointwise multiplication) and or (close to matrix addition) and the erasing and ad- 
dition matrices. This will be our model of computation, to be known as nodeless MGG. 
Productions can be understood as swaps with restrictions: productions are one of the 
possible ways to set constraints on the applicability of swapsF^ 

The model of computation associated to MGG is the 5-tuple (CJ, T,V,t, Ho) where 

- g = {£ = (L, if, A) I A : V{C) T}, being V{C) the set of vertices. The for- 
mal notation C — LV iK will be used. 

- T is the finite set of labels. 

- V is the finite set of compatible productions. 

- T is the transition function that modifies the state 

T-.gxv — >g 

(H,p)^{H,p)^piH)^H' 

- Hq is the initial system state (see Sec.|2]i. 

The model is deterministic if r is a single-valued function and non-deterministic 
if it is a multivalued function. The default halting condition is "no production can be 
applied". 

The two basic operations are again non-terminal symbol replacement and concate- 
nation, on which we touch in the following paragraphs. 

Concatenation in MGG is defined for productions, which is just the sequential ap- 
plication of two or more productions. It will be represented by a semicolon and should 
be read from right to left (like standard composition of functions). Refer to Sec. |2]for 
more details. 

Graph rewriting substitutes the occurrence of some graph (known as pattern graph 
or production LHS, L) in the host graph (H) by the corresponding replacement graph 
(also known as production RHS, R). The specification of the operation is done through 
a so-called grammar rule, production rule or just production, and is represented as a 
graph morphismp : L R. The operation itself is known as a derivation. Refer again 
to Sec. |2] Graph rewriting plays in MGG the role of non-terminal symbol replacement 
in formal grammars. 

We can associate the element p = e V ir to any production p, being e and r their 
erasing and addition matrices, respectively, which have been introduced in Sec. |2] No- 
tice that as e and r are disjoint, the or and the matrix addition coincide, so we can think 
of p as an element of M". 

We shall see in Sec.|7]how swaps and productions as introduced here are related. 
Grammar state will be for us the next production to apply and where it is to be applied (match 
of the production in the host graph). In some sense, the state of the productions. With system 
state we shall refer to the actual host graph (the state of the object under study). The term state 
alone will mean the grammar state plus the system state. 



From the element p — e V ir we can define a swap uj = P{p) ~ P{e V ir) = 
er V i{e V r). Again, as er and e V r are disjoint we can write 

u; = {e + l)(r + 1) + i{e + r) = {e + r + 1) + i{e + r). (26) 

Notice that ui satisfies eq. dSTT i. hence the name. Moreover, as was proved in [18|, i? = 
{L, P{p))- Taking relabeUng into account we can write an affine production as 

R\JiQ^ n^p{C) = (7 ■ {a ■ {L\J iK) ■ a^erV i{ey r)) ■ ct* = 

= (LW iK,c7 - [erW i{eW r)] - a*) , (27) 

which is just eq. ( fTSl ) including the nihil parts K and Q. 

The application of the production p to a host graph G to derive the system state H 
is given by: 

H=p{G) = {G,a.P{p)). (28) 

The matching - see Chap. 6 in [ 16,| - is performed by the relabeling a. Notice the 
non-determinism of this step and its NP-completeness. Compare with eq. ( |20] |. 
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Fig. 8. Sample Permutation plus Host Graph 



Example. DLet's consider the production p and the host graph G depicted in Fig. [S] 
The nihil terms K and Q have been omitted to ease exposition. All nodes are assumed 
to be of the same type so neither the numbers nor the colors in the figure should be 
understood as labels. Numbers are used for referencing purposes only. 

Production p deletes edge (1, 2) and adds a new edge (2, 1). It also keeps edges 
(1, 1) and (1, 3), demanding their existence in the host graph. There are initially four 
possible identifications of the LHS L in G, which correspond to the following map- 
pings: (1, 2, 3) I — > (1, 2, 3), (1, 3, 2), (3, 2, 1) and (3, 1, 2). The permutation matrices 
are 





'1 00' 




'1 0' 




'0 r 




'0 1" 




1 


CT2 = 


1 




1 


0-4 = 


1 




.0 1. 




.0 1 0. 




.1 0. 




.0 1 0. 



with associated productions 

q,{L) = {L,a,.p) - (i, (T, • p • a*), i G {1,2,3,4}. 

In principle, all four are valid productions but not all can be applied to G. Production 
P2 would try to add edge (3,1) which already exists in G. The same problem appears 
with p4 but this time with edge (1,3). 



Apart from illustrating how relabeling works on productions, this example tries to 
show how it will be of help in characterizing non-determinism in Sec.|6] ■ 

The main difference between what is exposed in this section and Sec. |2] is that 
nodeless MGG does not act on nodes but just on edges. This has the advantage of 
avoiding dangling edges and all compatibility issues derived. 

The section ends brushing over submodels (by setting further constraints on MGG) 
and supermodels of computation (by relaxing the axioms of nodeless MGG). An initial 
proposal is: 

1. On the matching: 

- For submodels, a subisomorphisn^ instead of an injective morphisms - as in 
Def.|9]- can be demanded. 

- As a supermodel, the injectivity of the morphism may be relaxed. 

2. On the productions: 

- For submodels, relabeling can be forbidden. Other constraints on the operations 
can be set by using application conditions (see 1 16 1, Chaps. 8 and 9). 

- Nodeless MGG can be extended by allowing graph constraints and application 
conditions (see [16]). Also, new operations on graphs can be allowed such as 
negation of graphs or we can permit general adjacency matrices in the multi- 
plication instead of just permutational matrices (see Prop.|3]below). 

3. On the underlying space: 

- For submodels, instead of considering simple digraphs we may consider undi- 
rected graphs or even subsets such as the group of permutations (digraphs with 
single incoming and outgoing edges). 

- Other structures more general than simple digraphs can be allowed. Examples 
are multidigraphs and hypergraphs. 

Limiting matrix multiplication to permutation matrices is a big restriction, at least 
concerning the amount of matrices left out. The following proposition shows the orders 
as functions on the number of nodes: 

Proposition 3. Assuming that all nodes are of the same type, the orders of the number 
of permutation matrices per swap (n) and per production Pp (n) are: 




PAn) = = - e e-l"-^ (30) 

' #(prods) 3" 3" 

where n is the number of nodes. 

Proof 

□Recall that logf,(a;) = j ■ The Stirling formula for the factorial (the number of 
permutations) has been used in eqs. ( |29] l and (|30] |. For the denominator, notice that a 



A subisomorphism can be defined as an isomorphism between the LHS of a production and 
the part of the host graph in which it is identified. 



swap is fixed by its real or imaginary parts, so there are as many as adjacency matrices 
for graphs with n nodes: 2" . To prove eq. ( |29] l just take the logarithms to derive 

^ + ^_2_„ + „log(n)-log(2)n2. (31) 

All we have to do to check eq. (l30l l is to count the number of different productions 
that can be defined for graphs with n nodes. One possibility is to establish a link be- 
tween productions and the standard Sierpinski gasket^To this end, write the adjacency 
matrix of the graph g of n nodes as a single column (the second sitting right after the 
first, the third after the second and so on). Recall that a production p can be written as 
p = eV ir, with er — 0. Hence, productions are the zeros of the and function. See 
Fig. |9] which represents well defined productions of the form p ^ eV ir for matrices 
with two nodes. 
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Fig. 9. Weel-Defined Productions 



The Lucas correspondence theorem (see Q and also ||23l ) proves that the set of 
zeros is the Sierpinski gasket as it can be used to compute the binomial coefficient (^) 
mod 2 with bitwise operations: L A K. This tells us that the parity of the function (^) 
(this is what the function mod 2 does) is the same as that of L A K. In our case L is 
the abscissa. The negation of L just reverts the order (it is a symmetry) and does not 
change the shape of the figure. 

Counting the number of elements is not difficult. Notice that in Fig.|9] the big Sier- 
pinsky gasket is made of a number of copies of the smaller one. The small Sierpinsky 
gasket has 9 elements and it appears 3" times so the total amount of well defined 
productions is 3" . The main difference with respect to eq. dSTT l is the coefficient of 
—n^ which would be log(3) this time. ■ 



For more on the relationship between the Sierpinski gasket and swaps and productions, please 
refer to L17J . 



6 Determinism and Non-Determinism in MGG 



There are two types (sources) of non-determinism associated to the grammar state: 
selecting the next production to apply (let's call it election non-determinism) and finding 
the place in the host graph where the production will be applied (that we shall name 
allocation non-determinism). The transition function is not uniquely determined either 
because more than one production can be applied or because the oracle returns more 
than one place where the production can be applied. 

One easy way to mitigate or to even remove non-determinism is to use control nodes 
that indicate what production to apply or where it should be applied. Examples of these 
control nodes are the Si that appear in the productions of Sec.|8] Figs. [l4l[T5] and [T6l 
It is also possible that, for certain purposes, which production is applied or where it 
is applied becomes irrelevant^ An example of this behavior is the simulation of BCs 
given in Sec. |9] 



Li Ri La R2 L3 R3 




Fig. 10. Productions AppUcation (Derivation: Evolution of a System) 



If a sequence of productions s = pn', ■ ■ ■ ; pi (see Fig. [TOli is considered instead of 
a single production, allocation non-determinism can be graded. There is a first partial 
level where only productions are taken into account (ignoring the initial and intermedi- 
ate system states) and a second complete level if the host graphs are considered. Let's 
call them horizontal non-determinism and vertical non-determinism, respectively. 
Remark. DThe names stem from the way the identification of elements is performed 
according to the representation in Fig. [10] If the system states are not taken into account, 
the identification of nodes is horizontal (in productions pi). If the system states are 
considered, the identifications are given by the matching mappings rrii. ■ 

Nodes with the same label are potentially interchangeable so non-determinism in 
a single graph is equivalent to the Cartesian product of the corresponding permutation 
groups (one per type). For a given simple digraph L, let's denote this group by S{L). 
Their elements are represented by permutation matrices as explained in Sec. |3] 

An affine production p : L ^ R identifies nodes between L and i? in a unique way. 
The traslational part of the production that acts on edges does not affect the permutation 
group of the RHS. The rotational part a does not affect the permutation group of the 
RHS either: 

S{R) = {a ■ cr,; I a^ € S{L)} = S{L). 



This is related to confluence, not addressed in the present contribution. See (2). 



Hence, productions in nodeless MGcTl do not modify the permutation group associated 
to graphs. 

Horizontal non-determinism in sequences can be handled by letting the permuta- 
tional part of productions vary in the corresponding permutation group. Notice how- 
ever that not all variations are possible if applicability is to be kept (see the example of 
Fig.©. 
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Fig. 11. Sample Productions 



Example. DLet's consider the productions pi, p2 and p3 depicted in Fig. (TT] being 
1, 2 and 3 nodes of the same type. The group of permutations of three elements is 
5*3 = {id, (1 2), (1 3), (2 3), (1 2 3), (3 2 1)}. Their associated affine productions are 

q^m) = {L„a.p,) = {L„a-p, • a*) , ie {1,2,3}, 

with <T G 5*3. The sequence s = p^; p2; pi has associated affine sequence 

s' = q3;q2;qi = (((il,cr^Pl),cr^P2),Cr^P3>- 
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Table 1. Possible Relabelings for Sequence s = p2',Pi 



Table [T] includes all the permutations for the subsequence P2',Pi- A 1 means that 
applicability is kept and a means that applicability is not kept. Only the diagonal 
keeps applicability. The problem in all cases is that some already existing edge would 
be added by q2 . 



Productions would change the permutation groups if they were allowed to add or remove 
nodes. 
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Table 2. Possible Relabelings for Sequence s — ps; P2t Pi 



Table 12] summarizes all permutations for the sequence p^]p2]pi- Just the permu- 
tations that keep applicability for P2;pi has been considered (indexed by columns). 
Again, a 1 means that applicability is kept and a means that applicability is not kept. 
The problem in all cases is that some non-existent edge would be deleted by 53. 

In this example, as there are many possible relabelings, we have that the sequence is 
horizontally non-deterministic. Figure[T2lrepresents the effect of applying the sequence 
■s = 93^ ^ (I2 ^' ; to the initial host graph Li. ■ 




Fig. 12. Image of the Sequence of Three Productions 



Proposition 4 (Horizontal Determinism Characterization). Let^Sn = Pn',^- ■ ',Pi be 
a compatible sequence of productions and define the operator C(s„) = C^(s„) V 
C~{sn) by 



C+(.„) = V (i?JV;(eSr^) ViJ A (e^r^)] (32) 

1 — 1 ^ ' 




being = a^.Xi^ ai ■ Xi ■ cr*, Xi = ei,ri, Li, Ri, Ki, Qi, and 




There are three possibilities: 



1. The sequence is not applicable if for some i there exists no permutation such that 

CiSn) =0. 

2. The sequence is horizontally deterministic if for every i there exists a single permu- 
tation such that C(s„) = 0. 

3. The sequence is horizontally non-deterministic if for some i there exists more than 
one possible permutation a such that C(s„) ~ 0. 

Proof (sketch) 

□Theorem. 6.4.4 in fT6l characterizes applicabihty as equivalent to compatibility plus 
coherence. Applicability in a single place is determinism and applicability in several 
places is non-determinism. Compatibility is always fulfilled by nodeless MGG and nev- 
ertheless a hypothesis of the proposition, so it just remains to study coherence. 

Equations (|32] | and ( |33] ) for productions instead of affine productions characterizes 
coherence as proved in Th. 2, Sec. 5 in |18|, or in Theorems 4.3.5 and 4.4.3 in fT6]. 
To finish the proof, simply add permutations to the productions (to tackle relabeling) 
transforming them into affine productions. ■ 

We shall end this section by pointing out that vertical non-determinism can be ad- 
dressed as horizontal non-determinism. The only difference is that matrices (and thus 
permutation groups) will be larger 

7 From Boolean Algebra to a Matrix Algebra 

Matrix Graph Grammars use Boolean algebra mainly because production actions and 
graphs are characterized through Boolean matrices and Boolean operations (and and 
or). As swaps are linear transformations, it seems interesting to move to algebra and try 
to identify the linear operations (if any). The first thing to do is to recover productions 
from swaps. 

Recall that a swap can be applied to any host graph whereas for a production we 
need to find some elements in the host graph (L) and guarantee that some others {K) 
are not present: a production is a swap together with some restrictions. 

Proposition 5. Let p = [C, e + ir) = (L + iK, e + ir) be as in previous sections a 
production and uj — P{p) — (e + r + 1) + j(e + r) its associated swap, applied to the 
host graph G. The image of the production -TZ = p{C) - is given by: 

H = p{G) — a. UJ + G (graph transformation) 

QR = (a.K)G — {a.L){G + 1) = (application conditions) 

Proof 

□According to Prop. |4] the application can be deterministic, non-deterministic or im- 
possible, depending on a. In 1161 , Chap. 6, it is proved that a direct derivation can be 
defined (equiv., a production can be applied) if the production is well-defined (compat- 
ible) and its LHS is found in the host graph (and the nihilation matrix is not present in 
the host graph). 

The first application condition QR = is compatibility. If we limit ourselves to 
nodeless MGGs, this condition will be always fulfilled. The second condition demands 



the non-existence of K in G. It is equivalent to {a.K){G + 1) = a.K. The third 
condition demands the existence of L in G as it is equivalent to {a.L)G = a.L. ■ 
Remark. DThe graph transformation part is linear while the application condition part 
is non-linear. Application conditions and their generalizations (so-called graph restric- 
tions) are studied in |fT9l . and also in Chaps. 8 and 9 in lfT6l . ■ 
The next step is to characterize the application of a sequence with a finite number 
of productions, s = Pn\ ■ ■ - Wi- There will be again two parts, one linear with the 
actions and one non-linear with the restrictions (see Th.[T]below). In particular we shall 
guarantee in the propositions that follow that the sequence is compatible, coherent and 
that the initial digraph is contained in G + i{G + 1), being as always G the host graph. 

Proposition 6. With notation and hypothesis as in Prop. |4] the same conclusions hold 
if the operator C{s) = {s)-\'iC~ {s) that assigns one element of MJ^ to the sequence 
s is considered: 

n / ■ T 

-w—r / " -J"! 

c+{s) = 1 + n ( ^j.Yi [(^- + 1) + ^ + 1) + 

n j-1 \ 

+ L^R^ V (eS + 1) A « + 1) j , (34) 



C-{s) = l + l[ f [{r: + 1) e^] + KJ A [« + 1) r^] + 

n j-1 \ 

+ KJQ'^V^ {rl + 1) A (e^ + 1) j , (35) 

being 

1f{x,v) 
to 

yFix,y) 

to 

Proof 

□ All we have to do to apply Prop. |4] is to prove the equivalence between equa- 
tions ( [32l ) and ( [33l ) and equations ( [34l ) and ([35j, respectively. Also, we have to check 
sameness of A' and V' in Prop.|4]with A and V in this Proposition. To this end, simply 
use the following identities from propositional logics: 

n n 

Xj Axk — Xj • Xfe, V ^ n ^ ^■J'-' ' X = 1 + X. 

J = l J = l 

Both cases G+ and C~ are almost equal. ■ 
A similar reasoning applies to the calculation of the initial digraph of a coherent 
sequence and its compatibility in the following two propositions, as cmomented above, 
they will be used to characterize determinism of a sequence of productions. 



y=to \ x=y / 



1+ n 1+ n ^(^-y) 

y=ta \ x=to / 



Proposition 7. Let s — pn', - ■ ■ ;pi be a coherent sequence with notation as above. 
Then, the initial digraph is given by: 

n 

Mis) - Mcis) + lAMs) = Y [« + l)L; + t (e: + 1) (T^ + 1) K^] . (36) 

Proof 

□ ■ 

Proposition 8. Let s = Pn', ■ ■ ■ ] Pi be a sequence made up of compatible productions. 
Then, s is compatible ifW{s) — 0, w/iercj 

n 

= Y [(e" + 1) [rZ + 1) Mc{s,)Mm{s,)\ . (37) 

Proof 

□ ■ 

Previous propositions allow us to characterize determinism - notice that determin- 
ism extends applicability as introduced in lfT6l . Chap. 1 -. According to Th. 6.4.4 in lfT6l 
all we need is compatibility and coherence of the productions plus finding the initial di- 
graph in G + i{G + 1). 

Theorem 1. Let s — pn', ■ ■ - 'jPi be a sequence of productions with associated swaps 
ujj,j = 1, . . . , n. The image H of the sequence when applied to the host graph G is 
given by: 

H = s{G) = X]J=i '^j- + ^ (graph transformation) 

W{s) — C{s) — Mi\f{s)G — Mc{s){G + 1) = (application conditions), 

being M{s) the initial digraph, W{s) the compatibility conditions andC{s) the coher- 
ence conditions. 

Proof 

□ ■ 

If we stick to nodeless MGG then the formulas get simpler as -I- 1 = = and 
Ki — ri,i E {1, . . . , n}. Again, the graph transformation part is linear while the part 
of the application conditions is non-linear. 

8 MGG and Turing Machines 

In this section the relationship between Matrix Graph Grammars and Turing machines 
(TM) is studied. Two standard introductions to TMs are |7| and 121]. We will see how 
to simulate a TM using nodeless MGG. The simulation of TMs using MGG is easy 
except that the tape in TMs is unbounded. 

The letter W has been chosen because compatibility can be understood as well-formedness of 
the sequence, i.e. the sequence does not define an operation that produces something which is 
not a simple digraph: the space is closed. 



According to ETI . a Turing machine can be formally defined as a finite-state ma- 
chine with a memory medium. It is the 7-tuple 

M ^ {Q,r,S,T,qQ,qa,qr) , (38) 

where Q is the set of possible states, F is the set of tape symbols (a special one named 
blank symbol belongs to U G F), U C P\{U} is the set of input symbols, r : 
Q X F Q X F X {LS, RS} is the (partial) transition function^ qn is the initial state, 
qa is the accept state and qr is the reject state. All sets under consideration are finite, 
except for the length of the tape. The blank symbol is the only one allowed to occur 
infinitely many times in the tape during computation. 

The set of states Q will be modeled with labeled nodes as well as the sets of symbols 
F and S and the initial, accepting and rejecting states qo, qa and qr- Productions will be 
used to model r. There will be as many productions as rows in the state table of the TM 
(see table [3]for an example). State tables are a common means to represent transition 
functions for TMs. 

There are several remarks at hand. First of all, finding a match for a production can 
be done efficiently. This is so because each production has a state node that works as a 
flag indicating the place(s) where the production can be applied. 

Second, it is straightforward to simulate non-determinism. Election non-determinism 
is non-determinism as normally defined in TMs. Allocation non-determinism happens 
in TMs when (in a non-deterministic TM) the same row of the state table can be applied 
to two (or more) different paths inside the computation tree. 




Fig. 13. Simulation of an Infinite Tape: Tape Management Productions, pd Adds a Cell 
to the Left and pcr Adds a Cell to the Right 

Third, the tape of any TM has unbounded capacity. On the MGG side, we want to 
stick to finite simple digraphs. Two productions (see Fig. [T3] ) will be responsible for 
tape enlargement just in case it is necessary!^ The idea is to use the labeled node LC to 
mark the leftmost cell and RC the rightmost one. Notice however that nodeless MGG 
does not allow addition nor deletion of nodes. 

The point here is that a TM is a uniform model of computation while nodeless MGG 
is not. There are two possibilities: 

LS stands for left shift and RS for right shift. At times the set {LS, RS, Nmov} is considered, 
where Nmov allows the machine to stay in the same cell. This variation does not increase the 
machine's computational power. 

In Fig.[T3] LC, RC and U are labels but tc is not: it just points out what nodes are identified 
by the mapping. For an explanation of the meaning of the labels, see the example below. 



- We just need to add nodes. Deletion of nodes is not necessary for TMs. Therefore 
we may keep the main property of nodeless MGGs (no compatibiHty issues) if we 
allow the grammar to add nodes to the tape. 

- We may use a non-uniform model of computation and have one nodeless MGG per 
number of nodes. The only problem with non-uniformity is that various nodeless 
MGG may have completely dissimilar structure. Precisely, this is avoided with pro- 
ductions Pel and PcrCJ Observe also that addition of nodes have been used just to 
simulate the infinite tape, but not for modeling the behavior of the TM. 

The rest of the section is devoted to an example that models a copy subroutine, taken 
from ||25l . Its behavior is summarized in table [3] This TM replicates any sequence of 
ones inserting a zero in the middle (the head should be positioned in any of the I's that 
make up the string). For example, it transforms 0000110 i — > 0110110. 
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Table 3. Copy Subroutine State Table 



In table [51 the first column is the equivalent MGG production, which can be found 
in Figs. [14] and [15] The rest of the columns specify the TM behavior. Column init state 
is the initial state of the TM, the third column is the tape symbol in the cell being read, 
print op is the print operation to be carried out in the tape cell under consideration^ 
head motion indicates where the head should move tcj^ und final state is the state as- 
sumed by the TM (H is the halting state) which becomes the initial state of the TM 
for the next operation. For example, the second row (pn) says "if your state is si and 
there is a 1 (one) in the tape cell, then print a (zero), move to the cell on your left and 
assume state S2". 

^' An example of non-uniform model of computation is BC. Uniformity conditions have to be 
imposed to BC because otherwise even non-computable functions can be computed by small 
circuits. See V22i for the details. 

Nop stands for no operation, PO is "print zero" and PI is "print one". 

Nmov stands for no movement, HL for move head left (equiv., move tape right) and HR for 
move head right (equiv., move tape left). 




Fig. 14. First Six MGG Productions for TM Simulation. Refer to Table[3]for Their TM 
Counterparts 



One production simulates each action of the TM. There are ten, drawn in Figs. [141 
and[T5] A brief explanation follows: 

- Blue squared nodes represent tape cells. Either a or a 1 have to be written on 
them. As in this example they can not be blank, they should be initialized to (in 
Pel and Per on Fig.[T3]we should substitute U by 0). 

- Node Si stands for the ith state. It is also used to mark which cell the head points 
to. 
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Fig. 15. Four More MGG Productions for TM Simulation. Refer to Table [3] for Their 
TM Counterparts 



The set of operations that copies the ones in the word 0110 is Sc — Pio', P5o \ Pio', Pn] 
P30 ; P31 ; Pcz ; P20 ; Pii ; P50 ; P51 ; P40 ; P30 ; P20 ; Pc; ; P21 ; Pi 1 • This is the sequence that trans- 
forms 0110 into 110110. As commented in the introduction, MGG techniques are 
available for TMs. For example, it is possible to calculate Sc - at least we can guess 



what productions are necessary and the number of times that each one appears - using 
reachability (see lfT4l or Chap. 10 in |fT6l ). 




Fig. 16. Initial and Final States for the Copy Subroutine 



9 MGG and Boolean Circuits 

In this section we will simulate Boolean circuits (BC) using nodeless MGG. A standard 
reference on BCs is ll22]| . 

Recall that a BC is a simple digraph. We need only pay attention to the evolution of 
the circuit, which is equivalent to encoding the representation of the Boolean operations 
permitted in the BC. In |[22l . the first thing to do when defining a BC is to fix the Boolean 
operations allowed. In the present contribution we will restrict ourselves to the Boolean 
operations and, or and not. The same techniques that will be described apply to any 
Boolean operation. 

There are many ways to encode Boolean operations. We represent the gates with 
yellow circles labeled with the corresponding Boolean operation. The variables are un- 
labeled circles in blue. Notice that Xi and y are not labels but a means to visually repre- 
sent which nodes on the LHS are mapped to which nodes on the RHS. On the contrary, 
V, A, -■, and 1 are labels. 

Gates labeled with not can only have a single input and a single output node. Gates 
of types or and and must have two or more inputs and a single output. An unlabeled 
node is an input variable of some gate if there is an edge from the node to the gate. It 
will be an output node if there is an edge from the gate to the node. For example, y is 
an output node in ^(0) in Fig.[T7]and x is an input node. 

Input and output nodes can be in three states: either no value has been assigned yet 
(which will be indicated by a self-loop) or it has value or value 1 (represented with an 
edge from a properly labeled node). See Figs. fTTlfTS] and [T9l for some examples. 

The not operation (see Fig. [TtI i is almost self-explanatory. It is not mandatory to 
delete the input and output edges (to the not node) as the rule would not be applicable 
to the same elements in the host graph because of the missing self-loop in node y. We 
do it as a visual aid. The value of the input variable is kept (the edge is not deleted) 
because this node could well be the input of another gate. 

We will not limit the number of input nodes to or and and gates. To ease operations, 
we will demand an ordering in their input nodes. One way to achieve this is to introduce 
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Fig. 17. The not Operation 



a node ie (start-end) to mark the first input node and the last input node for some gate 
B (= and, or). Every gate of type and and or will have an edge incident to se (each 
gate B has its own ordering, but a single node may belong to the input nodes of several 
gates). One edge from input node xi to input node X2 will mean that xi goes before X2 
considered as an input node for gate B. See Figs. [TSjandfTglfor some examples. 
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Fig. 18. The or Operation 



The or operation is represented in Fig. [18] Production V(l) forces the result to be 
1 as soon as any input node has this value. Grammar rule V<;(0) decreases the number 
of input nodes if both are zero. Notice that Vs(0) is applied in strict order in the input 
nodes. Rule Ve(0) assigns a to the output gate when only the starting and the ending 
input nodes are left. 

As in the not gate, values assigned to the input nodes are not removed because 
a single node can be an input for several gates. The same remark applies to the and 
operation, which is described next. 
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Fig. 19. The and Operation 



The productions that encode the and operation are depicted in Fig. [19] Their inter- 
pretation is very similar to that of the or operation. 

The productions that appear in Figs. [TT] [18] and [19] simulate Boolean circuits non- 
deterministically. This non-determinism guarantees that the operations are performed 
optimally: only those nodes necessary to evaluate the BC will be calculated and in the 
precise order. 

The complexity of a BC is measured in terms of the size (number of gates) and the 
depth (length of the longest directed path). There are theorems that relate lower bounds 
on BCs with lower bounds on TMs (see |22 |). The non-deterministic encoding of BCs 
given in this section should not affect this complexity measures. Nevertheless, it should 
be clear how to transform non-determinism into determinism using some control nodes. 

10 Conclusions and Future Work 

In the present contribution we have introduced MGG as a formal grammar as well as 
a model of computation. We have also seen that non-determinism in MGG can be ap- 
proached through so-called relabeling. It has also been proved that nodeless MGG is ca- 
pable of simulating deterministic and non-deterministic Turing machines and Boolean 
Circuits. As a side effect, all MGG techniques are at our disposal to tackle different 
problems in Turing machines and Boolean Circuits. As commented in Sec. [T] we can 
(partially) address coherence, congruence, initial state characterization, sequential in- 
dependence, reachability and some others. Something similar but for Petri nets is done 
in [141 or in [16J, Chap. 10. However, we have not gone into these topics in depth. The 
theory developed so far can be applied straightforwardly in some cases, while in others 
some further research is needed. 

The use of MGG opens the door to the introduction of analytical and algebraic 
techniques over finite fields. We are particularly interested in representation theory and 
abstract harmonic analysis. 

Apparently, reachability and the state equation appear to be helpful concepts to ad- 
dress complexity problems. In f\M . the reason why the state equation is a necessary but 
not a sufficient condition is pointed out. It seems natural to move from linear algebra 
(state equation) to linear programming (positivity restrictions on variables). 
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